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(54) System and method used in a computer system for distributing tasks between 
multiprocessing I/O subsystems 



(57) A load balancing system and method for dis- 
tributing a task between Input/Output subsystems of a 
computer system. The load balancing system includes 
a central processing unit for transmitting the task to a 
central controller. The central controller in response to 
the transmitted task generates and transmits a request 
command over a side-band bus to a plurality of periph- 
eral components. Each of the peripheral components 



that are capable of servicing the request command will 
transmit a response signal over the side-band bus back 
to the central controller. Thereafter, the central controller 
by processing all the response signals dispatches a 
service command signal to one of the capable periph- 
eral components. The service command signal will di- 
rect the one capable peripheral component to service 
the task. 
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D script ion 

[0001] The present invention relates to a system and 
method for distributing tasks between multiprocessing 
I/O subsystems of a computer system and, in particular, 
to a system and method using side-band signaling to 
provide low overhead load balancing among dedicated 
I/O processors. 

[0002] A computer system having multiple central 
processing units (CPUs) that are capable of operating 
in a cooperative fashion on several tasks at one time is 
referred to as a multiprocessing system. The multiproc- 
essing system is often incorporated within equipment 
such as desktops, servers or high-end workstations. 
[0003] Currently, one of the most significant design 
challenges associated with the multiprocessing system 
is the routing and processing of tasks. The multiproc- 
essing system generates one task in response to receiv- 
ing an interrupt, where the interrupt can be character- 
ized as an asynchronous event occurring independently 
of any task currently running and requiring the attention 
of one of the CPUs. The processing of the task or action 
taken by one of the CPUs in response to receiving the 
interrupt is commonly known as "servicing" or "handling" 
the interrupt. 

[0004] Traditionally, the multiprocessing system will 
balance or distribute the interrupts and corresponding 
tasks among the various CPUs to increase system per- 
formance. More specifically, the load balancing or dis- 
tribution of tasks is accomplished in a manner where the 
least busy CPU is used to service the interrupt. The least 
busy CPU will generally handle the interrupt in the least 
amount of time by using a dedicated bus (e.g., Ad- 
vanced Programmable Interrupt Controllers (APIC) 
bus). The conventional techniques for load balancing or 
distributing tasks among CPUs in multiprocessing sys- 
tems include SyslemPro, APIC, OpenPIC and Distribut- 
ed OpenPIC. However, the conventional load balancing 
techniques are laden with software overhead resulting 
in the delay of data transfers and reducing the amount 
of application processing the CPUs can accomplish. 
[0005] To reduce the workload on the CPUs, an alter- 
native I/O system commonly known as Intelligent Input/ 
Output (120) System has been developed in the com- 
puter industry. The Intelligent I20 System is particularly 
advantageous in that it off-loads portions of work (e.g., 
interrupt/tasks) onto intelligent I/O subsystems, which 
frees the CPUs to handle more critical tasks. The intel- 
ligent I/O subsystems include dedicated I/O processors 
(lOPs) for performing details such as interrupt handling, 
buffering and data transfers. However, there is no indus- 
try standard for the load balancing or distribution of tasks 
at the level of the intelligent I/O subsystems. 
[0006] Accordingly, there is a need for a system and 
method for distributing tasks between intelligent I/O sub- 
systems of a multiprocessing system. There is also a 
need for a system and method having a dedicated side- 
band bus to provide low overhead load balancing be- 



tween multiple I/O processors within an Intelligent I20 
System Additionally, a system having a dedicated side- 
band bus is needed to improve scalability. These and 
other needs are satisfied by the load balancing system 

5 and method of the present invention. 

[0007] The present invention is a method and load 
balancing system for distributing a task between input/ 
output subsystems of a computer system. The load bal- 
ancing system includes a central processing unit for 

to transmitting the task to a central controller. The central 
controller in response to the transmitted task generates 
and transmits a request command over a side-band bus 
to a plurality of peripheral components. Each of the pe- 
ripheral components that are capable of servicing the 

is request command will transmit a response signal over 
the side-band bus back to the central controller. There- 
after, the central controller by processing all the re- 
sponse signals dispatches a service command signal to 
one of the capable peripheral components. The service 

20 command signal will direct the one capable peripheral 
component to service the task. 
[0008] A more complete understanding of the method 
and apparatus of the present invention may be had by 
reference to the lollowing detailed description when tak- 

2S en in conjunction with the accompanying drawings 
wherein: 



FIGURE 1 is a block diagram of a computer system 
incorporating a first embodiment of a load balancing 
system in accordance with the present invention; 
FIGURE 2 is a block diagram of the computer sys- 
tem incorporating a second embodiment of the load 
balancing system; 

FIGURE 3 is a signaling diagram illustrating a run- 
time operation on a sxle-band bus of the load bal- 
ancing system shown in FIGURES 1 and 2; and 
FIGURE 4 is a simplified flow diagram of an opera- 
tion of the toad balancing system shown in FIG- 
URES 1 and 2. 
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[0009] Referring to the Drawings, wherein like numer- 
als represent like parts throughout FIGURES 1-4, there 
are disclosed two embodiments of an exemplary load 
balancing system 50, which can be used to implement 

45 the present invention. 

[0010] Although a computer system 100 embodying 
the load balancing system 50 of the present invention 
will be discussed based on the intelligent 120 specifica- 
tion, it should be understood that the intelligent 120 

so specification is only one of many standards and archi- 
tectures having I/O subsystems that may utilize the prin- 
ciples of the present invention. Accordingly, the load bal- 
ancing system 50 described should not be construed in 
such a limited manner. 

ss [0011] Referring to FIGURE 1, there is illustrated a 
block diagram of the computer system 1 00 incorporating 
a first embodiment of the load balancing system 50. Ex- 
cluding the load balancing system 50, the general archi- 
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tecture of the computer system 100 is based substan- 
tially on the intelligent I20 specification, which is hereby 
incorporated by reference into this specification. 
[0012] For clarity, certain details associated with the 
intelligent I20 system including, for example, the split 
driver model, the Integrated Real Time Operating Sys- 
tem and the Peer-to-Peer communication scheme are 
well known in the industry and as such are not described 
herein. Therefore, the description provided below in re- 
lation the intelligent I20 system describes only the basic 
components that interact with the load balancing system 
50. 

[0013] The conventional intelligent I20 architecture 
includes one or more central processing unit(s) 102 
communicating across a host bus 104 with a North 
Bridge or Host : to-PCI bridge 106, and a memory con- 
troller 110 coupled with memory 108. The intelligent 1 20 
architecture also has a plurality of Input/Output Proces- 
sors (lOPs) 112, including a first IOP 114 and a second 
IOP 116 both of which communicate with the Host-to- 
PCI bridge 106 across a peripheral component interface 
(PCI) bus 118. A host operating system 120 generally 
controls the operation of the CPUs 102. 
[0014] The first IOP 114 may incorporate a first pe- 
ripheral component 122, and the second IOP 116 may 
incorporate a second peripheral component 1 24 and a 
third peripheral component 126. The second IOP 116 is 
considered multi-functional, because of the presence of 
two peripheral components 1 24 and 1 26; of course, any 
of the lOPs 112 may be multi-functional by having more 
than one peripheral component. Peripheral components 
122, 124 and 126 can be any type of PCI base adapter 
including, for example, a Local Area Network (LAN) con- 
troller, a Small Computer Systems Interface controller 
(SCSI) or a Redundant Array of Independent Drives 
controller (RAID). In addition, the individual lOPs 112 
are considered PCI based adapters or discrete input/ 
output subsystems in computer systems having archi- 
tectures different than that specified by the Intelligent I/ 
O (I20) Architecture specification. 
[0015] For illustrative purposes, the peripheral com- 
ponent 122, 124 and 126 are designated as LAN con- 
trollers coupled to independent LAN buses 1 28 for con- 
trolling communications. Each of the LAN buses 128 
supports a plurality of LAN components 1 32. Of course, 
either of the peripheral components 122, 124 or 124 
may be any type of PCI base adapter such as a SCSI 
controller for supporting a multiple of SCSI components 
(not shown). Although the second IOP 116 is illustrated 
as supporting two peripheral components 124 and 126, 
again it is understood that each IOP may support any 
number and type of peripheral component. 
[0016] The load balancing system 50 is used within 
the computer system 100 for distributing a task to one 
of the lOPs 112 or peripheral components 122, 124 or 
126. In general, the task is generated within one of the 
CPUs 102 in response to receiving an interrupt, and 
then the task is transmitted to one of the lOPs 112 for 



4 

processing to off-load a portion of work from the CPUs. 
The exemplary load balancing system 50 is discussed 
in greater detail below. 

[0017] During initialization or Power On Self Test 

5 ("POST") of the computer system 100, the startup con- 
figuration software scans the PCI bus 118 or buses to 
determine what PCI devices 112 exist and what config- 
uration requirements they may have. This process is 
commonly referred to as enumerating, scanning, walk- 

10 jng or probing the PCI bus 118. The process may also 
be referred to as the discovery process. The software 
program which performs the discovery process may be 
referred to as a PCI bus enumerator. The PCI device 
enumeration is done via PCI bus transactions to scan a 

is PCI configuration register 140 within the peripheral 
components 1 22, 1 24 and 1 26 to determine which of the 
peripheral components are configured to participate in 
the load balancing scheme in the first place. The periph- 
eral components 122, 124 and 126 configured for toad 

20 balancing are then assigned an unique response iden- 
tifier (RID), for example, the peripheral components 
122, 1 24 and 1 26 may have RIDs 1 42 with logical values 
of "0", "1 B and "2", respectively. The PCI bus enumerator 
also scans each of the PCI configuration registers 1 40 

25 to determine a maximum number of queued tasks 144 
that can be stored or queued at any given time within a 
peripheral queue 146 of each peripheral component 
122, 124 and 126. 

[0018] Furthermore, the PCI configuration registers 
30 140 include a configuration space having a task capa- 
bility register, a maximum queue depth register and a 
current queue depth register 

[0019] After initialization of the computer system 100, 
a central controller 148 coupled to the Host-to-PCI 

35 bridge 1 06 receives the task distributed from one of the 
CPUs 102. The central controller 148 uses a side-band 
bus 149 to communicate with the peripheral compo- 
nents 122 ; 124 and 126. The central controller 148 in- 
cludes a configuration table 150 having a set of 64-bit 

40 wide registers that are indexed by the RIDs 142, each 
register stores a current number of queued tasks 152, 
the maximum number of queued tasks 144, and a bit 
pattern 154. In addition, the configuration table 150 in- 
cludes a configuration space containing a maximum 

45 number of queueable or pending tasks register, a cur- 
rent number of tasks register and a task capability reg- 
ister. 

[0020] Each bit pattern 154 is representative of a 
class or group of tasks that a corresponding peripheral 

50 component 122, 124 or 126 is capable of servicing for 
the load balancing system 50. For example, if the first 
peripheral component 122 was configured to handle a 
particular task then one of the bits would be set to '1 " in 
the bit pattern 1 54; in contrast, if the first peripheral com- 

55 ponent was not configured to service the particular task 
then the one bit would be set to "0". 
[0021] Furthermore, each of the 64-bit wide registers 
may have a form such that the lower 32-bits are re- 
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served for storing the bit pattern 154, and the upper 32 
bits are evenly divided for storing the current number of 
queued tasks 152 and the maximum number of queued 
tasks 144. Of course, the configuration table 150 may 
have different forms besides the 64-bit wide registers 
described above. 

[0022] The central controller 148 also includes a proc- 
essor 156 for masking out the peripheral components 
122, 124 or 126 that are not capable of servicing the 
transmitted task by comparing the respective bit pattern 
1 54 to the task that is to be serviced next. The processor 
156 then transmits a request command (e.g., multicast 
signal) to each of the capable peripheral components 
1 22, 1 24 or 1 26. Assuming, the second peripheral com- 
ponent 1 24 was configured for SCSI operations instead 
of the illustrated LAN operations then the second pe- 
ripheral component would not be capable of servicing 
the task : and as such would be masked out by the proc- 
essor 156. 

[0023] Upon receiving the request command, the ca- 
pable peripheral components 122, 124 or 126 transmit 
a response signal 1 47 over the side-band bus 1 49 to the 
central controller 148. More specifically, the response 
signal 1 47 is transmitted by a sub-controller 1 58 located 
within each peripheral component 122, 124 and 126. 
The sub-controller 158 may be referred to as an asym- 
metric multiprocessing I/O processing with I20 capabil- 
ity (AMIOPIC) sub-controller. The sub-controller 158 al- 
so includes a queue counter 160 that operates to keep 
track of the current number of queued tasks 1 52 pres- 
ently located within the peripheral queue 146. 
[0024] For example, if the task is distributed to the first 
peripheral component 1 22, then the respective sub-con- 
troller 1 58 will place the task within the peripheral queue 
146 and increment the queue counter 160 by one. Like- 
wise, the queue counter 1 60 will be decremented by the 
sub-controller 158 when any of the queued tasks have 
been completed and/or removed from the peripheral 
queue 146. 

[0025] The response signals 147 transmitted by the 
sub-controllers 158 of the capable peripheral compo- 
nents 122, 124 or 126 are received and processed by 
the central controller 148. In processing the response 
signals 147, the central controller 148 will arbitrate and 
dispatch a service command signal to one of the capa- 
ble peripheral components 1 22, 1 24 or 1 26. The service 
command signal will direct the one capable peripheral 
component 122, 124 or 126 to service the task. More 
particularly, the processor 156 compares all of the cur- 
rent number of queued tasks 152 to the corresponding 
maximum number of queued tasks 1 44 currently stored 
in the configuration table 150 to determine the one ca- 
pable peripheral component 122, 124 or 126. The one 
capable peripheral component 122, 124 or 126 has a 
least number of current tasks relative to the maximum 
number of queued tasks 144. 

[0026] In the event all of the capable peripheral com- 
ponents 122, 124 and 126 have the current number of 



queued tasks 1 52 greater than or equal to the maximum 
number of queued tasks 144, then the central controller 
1 48 will place the task on a central queue 1 62. The task 
will remain on the central queue 1 62 until anyone of the 

s capable peripheral components 122, 124 or 126 be- 
comes available, thereafter, the task or service com- 
mand signal is distributed to that particular peripheral 
component. The capable peripheral component 122, 
124 or 126 is considered available when the current 

10 number of queued tasks 152 falls below the maximum 
number of queued tasks 144. 

[0027] As mentioned earlier and now described in 
more detail, the side-band bus 149 includes multiple da- 
ta lines (d0-d3), a control (CTRL) line and a clock (CLK) 

15 line having a predetermined frequency. The frequency 
(e.g., 25-66 MHZ) of the clock line is generally synchro- 
nous to the speed or frequency of the PCI Bus 118. The 
side-band bus 1 49 includes a predetermined number of 
pins (e.g., six are shown) for allowing a scalable proto- 

20 col, which will be discussed in more detail with reference 
to FIGURE 3. The side-band bus 149 may be referred 
to as an asymmetric multiprocessing I/O processing 
with 120 capability (AMIOPIC) bus. 
[0028] Also mentioned earlier was the fact that each 

25 of the configured peripheral components 122, 124 and 
126 are assigned the unique RID 142 and, more specif- 
ically, the unique RIDs define when and on which data 
line (d0-d3) each of the peripheral components will re- 
spond to queries by the central controller 148. For ex- 

30 ample, the second peripheral component 124 is as- 
signed the unique RID 1 42 having a value of "1 " and as 
such the second peripheral component will utilize the 
data line (d1), CLK and CTRL lines for communicating 
with the central controller 148. Similarly, the third periph- 

35 eral component 1 26 will use the data line (d2), CLK and 
CTRL lines for communication purposes (e.g., response 
signal), because the third peripheral component is as- 
signed the unique RID 142 having a vafue of "2". 
[0029] For illustrative purposes, the peripheral queue 

40 146 of the first peripheral component 122 has three 
tasks that are queued to be processed by the first pe- 
ripheral component, where the current number of 
queued tasks 1 52 having a value of three in binary form 
"01 1 " is transmitted to the central controller 1 48 by one 

45 of the response signals 147 on the data line (dO). Like- 
wise, the second peripheral component 124 has one 
task currently queued, where the current number of 
queued tasks 152 having a binary form "001 " is trans- 
mitted by another of the response signals 147 on data 

so line (d1). Lastly, the third peripheral component 126 has 
five tasks that are currently queued, where the current 
number of queued tasks 1 52 having a binary form "1 Oi- 
ls transmitted by yet another of the response signals 147 
on data line (d2). In the present example, the second 

ss peripheral component 1 24, will r ceive the task or serv- 
ice command signal from the central controller 148, be- 
cause the second peripheral component has the least 
number of tasks currently awaiting to be serviced, as- 
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suming all of the maximum number of queued tasks 1 44 
have the same value. 

[0030] Reference is now made to FIGURE 2, where 
a second embodiment of the load balancing system 50 
is illustrated using prime referenced numbers. The load 
balancing system 50' is similar to the first embodiment 
except for the form of the response signals generated 
by the peripheral components 122', 124' and 1 26*. Also, 
the sub -controllers 1 58' of the second embodiment com- 
pare the current number of queued tasks 152' to the 
maximum number of queued tasks 144', instead of mak- 
ing the comparison at the central controller 148 of the 
first embodiment. 

[0031] Referring to FIGURE 2, there is illustrated a 
block diagram of the computer system 100' incorporat- 
ing the second embodiment of the load balancing sys- 
tem 50'. For clarity, only the differences between the first 
embodiment and the second embodiment of the present 
invention will be described. As such, reference is made 
to the central controller 1 48' and the peripheral compo- 
nents 122', 124' and 126' for the following discussion. 
[0032] Each peripheral queue 146' of the peripheral 
components 1 22\ 1 24' and 1 26' includes a level priority 
mark 1 23\ The level priority mark 1 23' is generally below 
the maximum number of queued tasks 1 44 (first embod- 
iment). 

[0033] Upon receiving the request command (e.g., 
multicast signal) from the central controller 148', each 
of the sub-controllers 1 58' compares the current number 
of queued tasks 152' to the level priority mark 123' to 
determine if the corresponding peripheral components 
122', 124' and 126' are ready to service the task. As- 
suming, of course, the peripheral components 1 22', 1 24* 
and 126* are configured and capable of servicing the 
task in the first place. At this point, the peripheral com- 
ponents 1 22', 1 24* and 1 26 do not know the specific de- 
tails about the task that will be dispatched. 
[0034] If the current number of queued tasks 152' is 
less than the corresponding level priority mark 123', 
then the particular sub-controller 158' (e.g., first and 
second peripheral components 124' and 126') will par- 
ticipate in the selection process by transmitting a high 
signal V on the respective data line (e.g., data lines 
(dO) and (d1 )). On the other hand, if the current number 
of queued tasks 152' is greater than or equal to the cor- 
responding level priority mark 123', then the particular 
sub-controller 158' (e.g., third peripheral component 
1 26') will refuse to participate by generally transmitting 
a low signal "0" on the respective data line (e.g., data 
line (d2)). Of course, the level priority mark 123' may be 
set at a different value for each of the peripheral com- 
ponents 122', 124' and 126', and the high "1" signal and 
low "0" signal may be interchanged. 
[0035] In the second embodiment, the central control- 
ler 148' no longer stores the current number of queued 
tasks 152 and the maximum number of queued tasks 
144 associated with the first embodiment. Instead, the 
configuration table 150' of the central controller 148' 



stores the high "1 " signal and/or low "0" signals gener- 
ated by the sub-controllers 1 58' in addition to storing the 
bit pattern 154'. 

[0036] The central controller 1 48' in processing th re- 

s sponse signals dispatch the service command signal to 
one of the capable peripheral components (e.g., first 
and second peripheral component 124' and ^^trans- 
mitting the high T signal in response to the request 
command. This is where the central controller 148 

10 checks the task vs. the task capability registers of all the 
peripheral components 122', 124' or 126* reporting a 
high "1 " signal to select the one peripheral component 
to service the task. Generally, the central controller 148' 
uses a round robin scheme to dispatch the service com- 

15 mand signal to the one capable peripheral component 
122', 124' or 126'. For example, the round robin scheme 
is one where the central controller 148* distributes a first 
task to the first peripheral component 122' and then a 
second task to the second peripheral component 124', 

20 provided both peripheral components are ready and ca- 
pable of servicing either task. 

[0037] Referring to FIGURE 3, there is illustrated a 
signaling diagram showing a run-time operation on the 
side-band bus 1 49 for either embodiment of the load bal- 

25 ancing system 50. As described earlier, the side -band 
bus 149 includes the data lines (d0-d3), the control 
(CTRL) line and the clock (CLK) line on which transpires 
the communications (e.g. : request, response and serv- 
ice command signals) between the central controller 

30 148 and the sub-controllers 1 58. 

[0038] Initially at stage 300, the central controller 1 48 
transmits the request command on the control lin 
(CTRL) and data lines (d0-d3) for soliciting the status of 
the peripheral queues 146 from the capable peripheral 

35 components 122, 124 and 126. At stage 302, a turn 
around period occurs while the sub-controllers 158 
process the request command. 
[0039] At stage 304, the peripheral components 122, 
124 and 126 having unique RIDs 142 with values of 

40 "0-3* responds on the assigned data lines (d0-d3). A 
fourth peripheral component having a RID 142 with a 
value of "3" is not shown. In the first embodiment, for 
example, the first peripheral component 122 will trans- 
mit the response signal 147 in binary form "011" on the 

45 data line (dO). In the second embodiment, the first pe- 
ripheral component 122' will transmit the response sig- 
nal 147* having a high T signal on the data line (dO). 
The second and third peripheral components 124 and 
126 of either embodiment also transmit the correspond- 

50 jng response signals 147 on data lines (d1 and d2). 
[0040] At stage 306, another turn around period oc- 
curs while the central controller 148 processes all of the 
response signals 147. Then at stage 308, if required, 
the peripheral components (not shown) having uniqu 

ss R| Ds 1 42 with values "4-7" respond to the request com- 
mand on the data lines (d0-d3). Depending on th total 
number of the peripheral components connected to the 
side-band bus 149. there may be any number of the 
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stages 306 and 308, thus permitting h sen-able protocol. 
Also, the total number of data lines may be increased 
(e.g., data hnesd0-d8) to enhance the scalable protocol. 
[0041] After all of the peripheral components 122, 124 
and 126 respond to the request command by generating 
and transmitting the response signals 1 47, then at stage 
310, the central controller 148 dispatches the service 
command signal on the control (CTRL) line to one of the 
capable peripheral components 122, 124 or 126. The 
service command signal will direct the one capable pe- 
ripheral component 1 22, 1 24 or 1 26 to service the task. 
[0042] Referring to FIGURE 4, there is illustrated a 
simplified flow diagram of an exemplary method of op- 
eration for the load balancing system 50. As discussed 
earlier, the operation of the two embodiments of the load 
balancing system 50 may be performed within the com- 
puter system 100 based on the intelligent I20 specifica- 
tion or within any computer system having multiple I/O 
subsystems. 

[0043] Beginning at step 402 and upon initialization, 
the PCI bus enumerator which may be part of the basic 
input/output system (BIOS) 1 38 interrogates and scans 
the PCI configuration register 140 located within each 
of the peripheral components 122, 124 and 126 to de- 
termine which of the peripheral components are config- 
ured to participate in the load balancing scheme in the 
first place. Each of the configured peripheral compo- 
nents 122, 124 and 126 are assigned the unique re- 
sponse identifier (RID) 142. 

[0044] At step 404, one of the CPUs 1 02 will transmit 
the task to the central controller 148. 
[0045] At step 406, the central controller 1 48 deter- 
mines which of the peripheral components 1 22, 1 24 and 
126 are capable of servicing the task by broadcasting 
the request command or multicast signal to the capable 
peripheral components 122, 124 or 126. 
[0046] At step 408, each of the capable peripheral 
components 1 22, 1 24 or 1 26 in response to the request 
command generates and transmits the corresponding 
response signal 1 47 to the central controller 1 48. As de- 
scribed earlier, the response signal 147 may be the cur- 
rent number of queued tasks 152 (first embodiment) or 
a high/low signal (second embodiment). 
[0047] At step 409, the central controller 148 then 
masks out ail of the peripheral components that are not 
capable of servicing the task (e.g., registered to receive 
dispatch for the specific task). 

[0048] At step 41 0, the central controller 1 48 upon re- 
ceiving and processing the response signals 1 47 arbi- 
trates and dispatches the sen/ice command signal to the 
one capable peripheral component 122, 124 or 126, as 
described above. 

[0049] At step 41 2, the one capable peripheral com- 
ponent 122, 124 and 126 upon receiving the service 
command signal may place the task onto the corre- 
sponding peripheral queue 1 46 until the one capable pe- 
ripheral component is able to service the task. 
[0050] From the foregoing, it can be readily appreci- 



ated by those skilled in the art that the present invention 
provides a load balancing system and method for use 
in a computer system to distribute tasks between multi- 
ple I/O subsystems. Also, the load balancing system as 
5 disclosed may utilize side-band signaling to provide low 
overhead load balancing among dedicated I/O proces- 
sors. 



10 Claims 

1 . A computer system having a load balancing system 
for distributing a task within input/output subsys- 
tems, said computer system comprising: 

1$ 

at least one central processing unit for trans- 
mitting the task; 

a central controller, responsive to the transmit- 
ted task, for generating and transmitting a re- 
20 quest command; 

aside-band bus; 

a plurality of peripheral components, each of 
the peripheral components capable of servicing 
the request command transmits a response 
25 signal over said side-band bus to said central 

controller; and 

said central controller, responsive to the trans- 
mitted response signals, for dispatching a serv- 
ice command signal to one of the plurality of 
30 peripheral components, where said one periph- 

eral component in response to the sen/ice com- 
mand signal services the task. 

2. The computer system of Claim 1 , wherein each re- 
35 sponse signal includes a current number of queued 

tasks. 

3. The computer system of Claim 1 , wherein the re- 
quest command includes a multicast signal. 

40 

4. The computer system of Claim 1 , wherein each cen- 
tral processing unit includes a basic input/output 
system for interrogating the plurality of peripheral 
components to determine which of the peripheral 

45 components are configured for load balancing, and 
for assigning a unique identifier to each of the con- 
figured peripheral components. 

5. The computer system of Claim 1 , wherein said cen- 
50 tral controller includes a processor for masking out 

all of the peripheral components not registered to 
receive the request command to determine which 
of the plurality of peripheral components are capa- 
ble of servicing the request command. 

55 

6. The computer system of Claim 5, wherein said cen- 
tral controller further includes: 
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a configuration table for receiving and storing a 
current number of queued tasks and a maxi- 
mum number of queued tasks for each capabte 
peripheral component: and 
said processor for determining the one of the 5 
plurality of peripheral components by compar- 
ing the current number of queued tasks to the 
maximum number of queued tasks for each ca- 
pable peripheral component. 

10 

7. The computer system of Claim 6, wherein said con- 
figuration table further includes a configuration 
space containing a maximum number of queueable 
tasks register, 

15 

8. The computer system of Claim 6, wherein said con- 
figuration table further includes a configuration 
space containing a current number of tasks register. 

9. The computer system of Claim 6, wherein said con- 20 
figuration table further includes a task capability 
register. 

1 0. The computer system of Claim 4, wherein each con- 
figured peripheral component includes a peripheral 25 
queue for indicating a maximum number of queued 
tasks using a level priority mark and for storing a 
current number of queued tasks, where if the cur- 
rent number of queued tasks is less than the max- 
imum number of queued tasks then the respective 30 
configured peripheral component transmits the re- 
sponse signal having a predetermined level over 
said side-band bus to said centra! controller. 

1 1 . The computer system of Claim 4, wherein each con- 35 
figured peripheral component includes a configura- 
tion space. 

12. The computer system of Claim 11, wherein said 
configuration space includes a task capability reg- 40 
ister. 

13. The computer system of Claim 11, wherein said 
configuration space includes a maximum queue 
depth register. 45 

14. The computer system of Claim 11, wherein said 
configuration space includes a current queue depth 
register. 

50 

15. A computer system used in a computer for distrib- 
uting a task, said computer system comprising: 

a central controller, responsive to the task, for 
generating and transmitting a service com- 55 
mand signal for servicing the task; 
a side-band bus; 

a plurality of capable peripheral components 



coupled by the side-band bus to the central 
controller; and 

said central controller includes a processor for 
determining which one of the capable peripher- 
al components has a least number of current 
tasks to be serviced by comparing a current 
number of queued tasks to a maximum number 
of queued tasks associated with each capable 
peripheral component, where said one of the 
capable peripheral components receives the 
service command signal. 

16. The computer system of Claim 15, wherein said 
central controller further includes a configuration ta- 
ble for storing the current number of queued tasks, 
the maximum number of queued tasks, and a bit 
pattern representative of a class of the plurality of 
tasks each capable peripheral components is capa- 
ble of servicing. 

17. The computer system of any of Claims 1 to 16, 
wherein said central controller further includes a 
central queue for storing the task until anyone of the 
capable peripheral components has the current 
number of queued tasks less than the maximum 
number of queued tasks. 

18. The computer system of any of Claims 1 to 17, 
wherein the side-band bus includes at least one da- 
ta line, a control line and a clock line. 

1 9. The computer system of Claim 1 8, wherein the side- 
band bus further includes a predetermined number 
of pins for permitting a scalable protocol. 

20. The computer system of Claim 1 or claim 1 5, where- 
in each configured or capable peripheral compo- 
nent includes a sub-controller coupled to the side- 
band bus, said sub-controller includes a queue 
counter connected to a peripheral queue for storing 
the current number of queued tasks. 

21. A method for distributing a task within Input/Output 
subsystems of a computer system, said method 
comprising the steps of: 

broadcasting a multicast signal to a plurality of 
peripheral components that are capable of 
servicing the task; 

generating and transmitting, responsive to the 
multicast signal, a response signal from each 
of the plurality of capable peripheral compo- 
nents; and 

dispatching, responsive to the transmitted re- 
sponse signals, a service command signal for 
servicing the task to one of the plurality of ca- 
pable peripheral components. 
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22. The method of Claim 21, further comprising the 
steps of: 

interrogating the plurality of peripheral compo- 
nents to determine which of the plurality of pe- 5 
ripheral components are configured for load 
balancing; 

assigning a unique identifier to each of the con- 
figured peripheral components; and 
obtaining a maximum number of queued tasks 10 
from each of the configured peripheral compo- 
nents. 

23. The method of Claim 21 : wherein the step of dis- 
patching includes transmitting the service com- 15 
mand signal to said one of the plurality of capable 
peripheral components having a least number of 
current tasks to be serviced. 

24. The method of Claim 23, wherein the step of trans- 20 
mitting the service command signal further in- 
cludes: 

receiving and storing a current number of 
queued tasks and ' a maximum number of 2s 
queued tasks for each capable peripheral com- 
ponent; and 

determining the one of the plurality of capable 
peripheral components having the least 
number of current tasks to be serviced by com- 30 
paring the current number of queued tasks to 
the maximum number of queued tasks for each 
capable peripheral component. 

25. The method of Claim 21, wherein said step of 35 
broadcasting further includes masking out all of the 
peripheral components not registered to receive the 
task to determine the plurality of capable peripheral 
components. 

26. The method of Claim 21, wherein the step of gen- 
erating and transmitting a response signal includes 
comparing within each capable peripheral compo- 
nent a current number of queued tasks to a level 
priority mark indicative of a maximum number of 
queued tasks, where if the current number of 
queued tasks is less than the maximum number of 
queued tasks then the corresponding capable pe- 
ripheral component will transmit the response sig- 
nal having a predetermined level. 50 

27. The method of Claim 21 , wherein the step of dis- 
patching further includes delaying the transmission 
of the s rvice command signal until anyone of the 
capable peripheral components has the current 55 
number of queued tasks less than the maximum 
number ol queued tasks. 
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Initialize the load balancing system: 
-interrogate the peripheral components; ond 
-Assign unique RID numbers to 
configured peripheral components 



402 



Tronsmit the task to the 
centrol controller 



404 



Broodcast the multicast signal 



406 



Generote/Tronsmit the response signals 
from the capable peripheral controller(s) 
to the central controller 



408 



Mask out all of the peripheral components 
not capable of servicing the task 



409 



Dispotch the service command signal to 
the one capable peripheral component 



410 



Service the tosk 



412 



FIG. 4 
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